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Production of Proteins 
Field of the invention 

5 The present invention relates to the production of 

proteins, more particularly to a method for improving the 
secretion of proteins from eukaryotic" cell's 

Background of the invention 

10 

It is well known that the genetic modification of organisms 
can be used to produce proteins or fragments thereof. 
Generally, proteins which are produced in eukaryotic cells- 
. are initially located within intracellular organelles. For 
15 example proteins may initially be located in the : 
endoplasmatic reticulum. 

For many applications it is preferred that proteins, 
especially heterologous proteins, are adequately secreted , 
20 extracellularly . For this purpose it is necessary that they 
can pass the cell plasma membrane in reasonable amounts and 
without substantial loss of protein activity. 

A problem in the production of heterologous proteins in 
25 lower eukaryotic cells is that for some proteins, 

especially proteins with relatively large hydrophobic areas 
on their surface, the secretion is inefficient, in 
.particular such proteins seem to have, a low flux through 
. the secretion system. 
30 . 

For example Frenken L.G.J. et al {19 94) discloses that 
hydrophobic antibody fragments tend to accumulate in the 
endoplasmatic reticulum. Furthermore, Sagt , C.J.M. et al 
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(1998) discloses that in Saccharomyces cerevisiae, some 

cutinases with mutations to introduce hydrophobicity are 
secreted in significantly lower amounts than wild-type 
cutinase. 

5 

It has been suggested to improve secretion of heterologous 
proteins produced in a genetically modified organism by the 
introduction of a glycosilation site in the heterologous 
protein. 

10 

EP-A-704,527 relates to a process for the preparation of 
insulin. This document discloses that the introduction of 
N- glycosilation consensus site : to a spacer region, which 
is not part of mature insulin molecule, remarkably 
15 increases expression in fungal .cells, which are transformed 
with DNA sequences encoding such insulin precursors. 

EP-A-394,951 discloses the introduction of a glycosilation 
site in fibroblast growth factor in order to induce 
20 secretion of this factor from transformed cells. 

WO-A-96/05228 , which is herewith incorporated by reference, 
relates to a method to produce single -chain Fv molecules in 
eukaryotic cells . This document discloses that 

25 glycosilation of single-chain Fv molecules can enhance the 
rate of secretion. For example an N-linked glycosilation 
site is introduced at position 19 in FR1 of V H of single 
chain Fv molecule against human TfR (human transferrin 
receptor) and in FR1 of V H of single chain Fv molecule 

30 against hapten DNP (Kurucz et al 1993) with an N linked 
glycosilation site at position 19 in FR1 of V H . More 
specifically for example an N linked glycosilation site is 
introduced at position 19 of V H of single chain Fv U7.6Ab- 
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sFv or OKT9 Ab-sFv. These proteins are excluded from the 
scope of the current invention. 

It has been found however that glycosilation at an 
5 arbitrary position in the protein often does not lead to 
the desired increased secretion of a protein, especially 
" " not for secretion of proteins comprising a hydrophobic 
region . 

10 Summary of the invention 

Surprisingly applicants have .found that especially, good 
results are obtained if the location of the glycosylation , 
group or groups is carefully chosen. 

15 In particular it has been found that proteins with : 
relatively large hydrophobic areas on their surface, which 
normally show a low secretion flux through the secretion; , 
system, can be secreted in significantly higher amounts if 
they are brought into glycosylated form, preferably N- 

20 glycosylated form, wherein the location of the 

glycosylation groups is chosen such that the hydrophobic 
stretches in the protein are shielded. On the other hand 
the location of the glycosylation groups is chosen .such 
that they do not materially affect the functionality of the 

25 protein e.g. through influencing the binding affinity -of 
the binding site and/or .active site" of the protein. 

Accordingly in a first aspect the present invention relates 
to a method for producing a protein, preferably a 
30 heterologous protein, in a lower eukaryotic cell, followed 
by secretion of said protein from said cell, said protein 
comprising at least one hydrophobic stretch, wherein the 
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protein is glycosylated such as to increase the level of 
secretion of the protein, without substantially affecting 
the functionality of the protein and wherein at least one 
N-glycosylation group is located between the N-terminus of 
5 the protein and the first hydrophobic stretch and with the 
proviso that 

a) the protein is not a single chain Fv molecule against 
human TfR with an N linked glycosilation site at 
position 19 in FR1 of V H , and 
10b) the protein is not_a single chain Fy molecule against 
hapten DNP with an N linked glycosilation site at 
position 19 in FR1 of V H . 

t The invention is especially advantageous for improving the 
15 secretion of heterologous proteins . In a preferred 

embodiment of the invention, the protein can advantageously 
; be de- glycosylated after secretion. 

Detailed description of the invention 

20 

In the specification and claims the following terms and 
abbreviations are used. 

As used herein, "eukaryotic cell" means a cell which 
25 comprises a nucleus containing the genetic material, 
surrounded by a cytoplasm which in turn is encompassed 
within the plasma membrane which marks the periphery of the 
cell. 
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A "gene" is a DNA sequence encoding a protein, including 
modified or synthetic DNA sequences or naturally occurring 
sequences encoding an RNA molecule, peptide, polypeptide, or 
protein and regions flanking the coding sequence involved in 
5 the regulation of expression. 

A ^hydrophobic stretch" of a protein is a sequence of amino 
acids in the protein that repel water. A quantitative 
definition of the term "hydrophobic stretch" is given in 

10 Blond-Elguindi, S. et al (1993). In this document a 
hydrophobic stretch can be identified by defining the 
Binding Protein (BiP) score of secreted proteins. This score 
consists of data generated with a peptide scan which 
determines the value of the BiP affinity for every 7 amino,, 

15 acids stretches of the protein. A value is given to the - 
central amino acid of the 7 amino acids stretch. When; the;., 
stretch contains hydrophobic amino acids at specific places 
the BiP is further increased. A total BiP score of 10 or 
more for a heptapeptide indicates a hydrophobic stretch. n 

20 Hydrophobic amino acids are alanine, , valine, . leucine, 
isoleucine, proline, phenylalanine, tryptophan and ... 
tyrosine, methionine, and histidine. Preferred are valine, 
leucine, isoleucine, phenylalanine, tryptophan and 
methionine. 

25 

A "hydrophobically modified protein" is a protein which has 
an aggregated BiP score which is at least 50 units, .more 
preferably at least 80 units, above the aggregated BiP score 
of the non-modified protein. The aggregated BiP score can be 
30 calculated by determining for each heptapeptide in the 
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protein an individual BiP score, in accordance to Blond- 
'Elguindi, S. et al (1993). The aggregated BiP score can then 
be determined by calculating the sum of all individual BiP 
scores - 

5 

A "homologous protein" is a protein which is expressed in 
untransf ormed cells. 

A "heterologous protein" is a protein which is not 
10 expressed in untransf ormed cells but which, under 

application of genetic engineering may be expressed in 
transformed cells. 

A "ribosome" is a particle composed of ribosomal RNAs and" 
15 ribosomal proteins that * associate with messenger RNA and 
catalyse the synthesis of protein. 

The "cytoplasm" is the contents of a cell that are 
contained within its plasma membrane but, in the case of 
20 eukaryotic cells, outside the nucleus. 

The "cytosol" is the content of the main compartment of the 
cytoplasm, excluding the membrane bound organelles such as 
the endoplasmic reticulum and the mitochondria. Originall 
25 defined operationally as the cell fraction remaining after 
membranes, cytoskeletal components, and other organelles 
have been removed by low-speed centrif ugation. 

The "mRNA" (messenger RNA) is an RNA molecule that - 
30 specifies the amino acid sequence of a protein. Produced by 
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RNA splicing from a larger RNA molecule made by RNA 
polymerase as a complementary copy of DNA. It is translated 
into protein by a process catalysed by ribosomes. 

5 "Translation" is the process by which the sequence of 
nucleotides in a messenger RNA molecule directs the 
incorporation of amino acids into protein; occurs on a 
ribosome . 

10 "N-terminus" (amino terminus) is the end of a polypeptide 
chain that carries a. free alpha-amino group. 

. The present invention relates to an improved secretion of 
i; . proteins , .-in particular heterologous proteins, ; from lower 
.15 eukaryotic cells,, . said proteins comprising at least one. 
hydrophobic stretch and/or said proteins being 
. hydrophobically modified.. 

For. the purpose of the invention the term protein is^ used 
20 to embrace proteins or fragments thereof, said fragments 
preferably comprising at least 10 amino acid residues, more 
preferred at least 50 amino acids, most preferred . from 100 
to 2, 000- amino acids. , . 

25 For the purpose of the invention, the. protein is not a 

single chain Fv molecule against human TfR with an N linked 
glycosilation site at position 19 in FR1 of V H/ . and the 
protein is not a single chain Fv molecule against hapten 
DNP with an. N linked glycosilation site at position 19 in 

30 FR1 of V H - 
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Eukaryotic cells for the purpose of the invention are lower 
eukaryotes such as for example moulds or yeasts- Preferred 
moulds belong to the genera Aspergillus or Trlchoderma. 
5 Preferred yeasts belong to the genera Saccharomyces, 

Kluyveromyces , Hansenula or Plchla. Especially preferred is 
the use of host cells selected from Saccharomyces 
cerevislae, Aspergillus awamori and Plchla pastorls. 

10 The host cells are us ed for the production o_f the proteins, 
especially heterologous proteins . Well-known genetic > . 
manipulation techniques lean be ; used to produce said^ 
heterologous proteins. For example a nucleotide sequence 1 
encoding the desired protein can be. inserted into a 

15 suitable expression vector encoding the necessary elements' 
for transcription and translation and in such manner that r 
the protein will be expressed under appropriate conditions v 
The methods required to construct these expression vectors 
are well known to those skilled in the art. Preferred 

20 vectors stably integrate at pre-defined positions in the 
chromosome. 

The present invention is particularly directed to improve 
the degree of secretion of proteins that normally have a 

25 low level of secretion. Another embodiment of the invention 
is especially preferred to be applied to the production of 
proteins which, under normal culturing conditions, are 
secreted at a level of less than 10 mg/g dry weight. Also 
the invention is especially advantageous for proteins which 

30 after glycosylation have a clearly improved level of 
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secretion, for example more than 1.5 times the secretion as 
compared to the non-glycosylated protein, more preferred 
more than 2.5 times, most preferably the level of secretion 
is improved by at least a factor of 3.5. Especially . 
5 preferable, the level of secretion is more than 50 mg/g dry 
weight, more preferred more than 100 mg/g dry weight, most 
preferred more than 200 mg/g dry weight. 

Also the invention is especially applicable to increase 
10 extracellular secretion of heterologous proteins which are 
not normally glycosylated. Preferably N-glycosylation is 
used. In order to introduce N-glycosylation during the 
production -of a protein a acceptor group is needed which 
can either be Asn-X-Ser or Asn-X-Thr (where X is any amino .,. 
15 acid except proline. 

The molecular .weight of the glycosylation groups is 
generally not critical. 

20 The location of the glycosilation groups is. chosen such 

that the hydrophobic stretches of the molecule are. shielded 
e.g. to reduce the binding affinity of Binding Protein such 
as Bip to said hydrophobic stretches. On the other hand the 
location of the glycosilation groups is chosen such that 

25. they, do not materially affect the functionality of the. 
protein e.g. through influencing the binding affinity of 
.the binding site and/or active site of the protein..,. 

Both effects can be adequately predicted or can be measured 
30 by conventional binding tests. Applicants believe that it 
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will be well within the capability of the skilled person to 
select for each protein the appropriate size and location 
of glycosy.lation groups such as to increase the secretion 
of said protein. 

5 

In particular it is preferred that the glycosylation groups 
are orientated such that they provide a shielding effect- 
for the hydrophobic areas of the protein when the protein . 
is present in linear form in the Endoplasmatic Reticulum 

10 (ER) of the host cell. Additionally it is preferred that 
the glycosylation does not materially affect the binding * 
and/or active site(s) of said protein when the protein is 
present in its folded three dimensional shape outside the - 
production cell . 

15 ' ' - •' •' • • • ' 

Although applicants do not wish to be bound by any theory * : 
it is believed that the beneficial effect of the 
glycosylation groups located at specific locations in the 
protein can be explained as follows. 

20 ' 

In the production of proteins in lower eukaryotic cells, 
ribosomes are believed to bind mRNA molecules in the 
cytosol of the cell where translation of these nucleotide 
sequences will commence. Where a mRNA sequence encodes, at 
25 the 5 end, a signal peptide, the translation of this will 
cause a migration of the translating apparatus to the 
surface of the ER. The signal peptide then becomes embedded 
in the membrane of the ER and therefrom, protein synthesis 
by further translation of the mRNA will be concurrent with 
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the translocation of . the developing protein molecule • into 
the lumen of the ER. 

Within the Endoplasmatic Reticulum, Binding Proteins such 
5 as BiP are present. These proteins tend to bind to 
hydrophobic stretches of the developing linear protein 
molecule. It is believed that this binding, which normally 
assists the folding of homologous proteins, leads to 
folding of protein molecules in such a way that their 
10 extracellular secretion from the host cell is much less 
efficient. 

- It is further believed that if a sequence of the mRNA 
■ encoding amino acids that will undergo glycosylation is 

15 translated prior to a sequence that encodes a hydrophobic 
stretch, this will result in- an shielding effect being 
exerted on the- hydrophobic stretch by the glycosylation. 
group. This shielding is believed to reduce the binding 
affinity between binding proteins and the hydrophobic 

20 areas. By reducing the binding proteins that attach torthe 
newly translated protein sequence the normal , course , of 
protein folding is prevented and thus the heterologous 
protein molecules are believed to retain a more linear 
conformation. 

25 - . - . 

As indicated above, it is believed that the glycosylation 
groups are suitably present, in part of the protein that is 
translated in the Endoplasmatic Reticulum prior to the 
hydrophobic stretch. To achieve this, the glycosylation 

30 site of the protein is located between the N-terminus of 
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the protein and the hydrophobic stretch of the protein. If 
more than one hydrophobic stretch is present in the 
protein, the glycosylation groups are preferably located 
between the N-terminus of the protein and the first 
5 hydrophobic stretch (i.e the hydrophobic stretch closest to 
the N-terminus) . Even more preferably a glycosylation 
groups is located prior to each hydrophobic stretch i.e. . 
the first glycosylation group between the N-terminus of the 
protein and the first hydrophobic stretch, the second 
10 glycosylation group between the first and the second 
hydrophobic stretch and so on. 

In addition to adequate shielding of the protein in the . 
Endoplasmatic Reticulum, it is also preferred that the 
15 location of the glycosylation groups is chosen such that 
they do not materially affect the functionality of the : 
protein once it has been secreted. 

Once secreted a protein normally is present in a specific 
20 three dimensional (3D) form. The functionality of a 3D 
structure of a protein is normally determined by the fact 
whether the binding sites and/or the active centres of the 
protein are inhibited in their functioning or not. 

25 Applicants therefore believe that the location of the 
glycosylation groups in the protein should preferably be 
chosen such that the functionality of the binding site(s) 
and or the active centre (s) of the protein is not 
inhibited. 
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Applicants believe that this, can be achieved by ensuring 
that the glycosylation groups are located at an adequate 
distance from said binding sites and/or active centres. In 
one embodiment this can be achieved by ensuring that the 
5 distance between the glycosylation sites and the binding 
site(s) and/or active site(s) is greater than the. length of 
the glycosylation group. For example if the size of the 
glycosylation group is about 10 Angstrom, said distance 
should be at least 10 Angstrom, more preferred more than 12 
10 Angstrom or even more than 15 Angstrom. 

In another embodiment the distance between the binding 
sites and/or, the active sites on the one hand and .the 
glycosylation site at the other side can be less than the 

15 size of the glycosylation group. Under those circumstances, 
however, the orientation of the glycosylation should be 
chosen such that it cannot interact with the binding sites 
and/or the active sites. For example if the active site is 
located at the inside of a 3D protein, the glycosylation 

20 site may for example be located=at the outer. surface .of the 
protein at a distance of less. than the size of the : 
glycosylation group to the active site, provided that the 
glycosylation group is at least partially directed away 
from the active site. For example said glycosylation group 

25 may point away from the surface of the protein and hence 
avoid interaction with the active site. 

In a- very advantageous embodiment the invention is applied 
to proteins having a generally roundish, egg- or - 
30 ellipsoidal shape. Examples of such proteins are for 
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example lysozymes, cutinases and antibodies, especially 
variable fragments of antibodies. In these roundish shapes 
generally one. binding area is present at one end of the 
ellipsoid (hereafter referred to as the upper end) . The 
5 active site often is present near the binding site e.g. 
located at the inside of ellipsoid close to the binding 
site. 

Applicants have now found that in elliposoidal proteins, 
10^ preferably the glycosylat ion groups are located at the 

lower end of the ellipsoid. Preferably the distance between 

the glycosylation site and a hypothetical plane separating 
. the binding site from the rest of the protein is more than 

10 .Angstrom, -more preferred more than 12 or even more than 
15 15 Angstrom. 

Applicants believe that based on the above teaching it will 
be well within the ability of the skilled person to design 
the appropriate location of the glycosylation groups. For. 
20 illustrating the invention more clearly, however the «, 

principles of the invention will be illustrated based on a 
number of actual embodiments. 

Embodiment 1: cutinases 

25 

WO 94/14963 discloses various mutants of cutinase. As can 
be seen from figure 11 of this document the hydrophobic 
mutations of the cutinase are preferably located in a 15 
Angstrom band around the Ca-117 plane. Also it is known 
30 that the active site of cutinase is located above the 
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hydrophobic band in the Ccc-120 direction, i.e. at the top 
of the ellipsoid. Based on the above teaching it will be 
clear to the skilled person that in order to maintain the 
functionality of the protein after glycosylation, the 
5 preferred location for glycosylation is the area below the 
above described hydrophobic, band in the Ca-115 direction 
i.e. the bottom of the ellipsoid. These modifications of 
the protein are illustrated in the examples. 

10 Additionally the location of the glycosylation group should 
preferably be chosen such that strength of the BiP binding 
to the linear protein is reduced in the Endoplasmatic 
reticulum.' As explained above this can advantageously be 
' achieved by introducing a glycosylation group between the 

15 N-terminus of the protein and the first hydrophobic 
stretch. 

For example a well-known hydrophobically modified cutinase 
is CY028 as described in G.M.J. Sagt et al, < 1998 ). This 

20 enzyme has been made hydrophobic by the following 

mutations: G82A, A85F, V184I, A185L, L189F. This results in 
the introduction of two hydrophobic stretches, and hence 
the formation of a hydrophobically modified protein, having 
an aggregated BiP score which is about 80 units above the 

25 BiP score for the corresponding wild-type cutinase. As will 
be shown in the examples the introduction of these two 
hydrophobic stretches indeed results in a significant 
decrease in secretion efficiency. 



WO 00/05389 



PCT/EP99/04919 



16 



To restore the secretion efficiency in accordance with the 
present invention, the skilled person would hence try to 
introduce a glycosylation site which satisfies the 
following criteria: 
5 1. The glycosylation site is located between the N- 

terminus of the protein and the first hydrophobic 

stretch . 

1. In the 3D structure the distance between the 

glycosylation site and the binding and/or active 
10 sites is sufficient to avoid ^interaction of the. 

glycosylation group with said binding and/or active 
sites . 

The 3D structure of '-this protein is given in figure 1. ' ' * 
15 Figure I shows the 3D structure of cutinase CY028. As can' 
be seen/ the protein is generally ellipsoidal in shape. The 
longest axis of the representation corresponds to about 40- 
4 5 Angstrom. The binding domain (1) is located at the upper 
end of the representation, the active site (3) is located 
20 at the inside of the protein near the binding domain. Amino 
acid 29 is located at (2) . 

Suitably therefore the glycosylation groups can be located 
at the lower end of the ellipsoid e.g. below plane (4) 
25 which is located about 10 Angstrom downwards from the 
active site. 

It is well-known in the art that to introduce glycosylation 
during the production of a protein an acceptor group is 
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needed which can either be Asn-X-Ser or Asn-X-Thr (where X 
is any amino acid except proline) . 

Therefore in order to ensure that adequate glycosylation 
5 takes place in accordance to the invention, the skilled 
person will seek possible mutations for the cutinase, which 
result in the availability of an additional acceptor group 
which can act as glycosylation site, said acceptor group 
being located before the first hydrophobic stretch and said 
10 receptor group in the 3D structure being located below 
plane (4) of figure 1. 

The amino acid sequence of the cutinase gene is given in..-; 
WO 94/14963 in figure ID. From this it can easily be worked 

15 out that the A29S mutation' (numbering starts at begin of : 
the pro-sequence) leads to the introduction of a 
glycosylation site. This glycosylation site glycosylation 
in accordance to the invention, since this is both located , 
before the first hydrophobic. stretch and also located at 

20 the lower end of the 3D structure (evidenced by (2) in 
figure 1) . 
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Embodiment 2; antibody (fragments) 

Already at a very early stage during evolution, antibodies 
5 have been developed to protect the host organisms against 
invading molecules or organisms. Most likely one of the 
earliest forms of antibodies must have been developed in 
Agnatha. In these primitive fishes, antibodies of the IgM i 
type consisting of heavy and lights chains have been 
10 detected. Also in many other forms of life ranging from 
amphibians to mammals, antibodies are characterized by the 
feature that they consist of two heavy and two light 
chains, although the heavy chains of the various classes of 
immunoglobulins are quite different. These heavy and light 1 
.15 chains interact with each other by a number of different *■ 
physical forces, but interactions between hydrophobic 
patches present on both the heavy and light chain are 
always important. The interaction between heavy and light 
chains exposes the complementarity determining regions ■ 
20 (CDRs) of both chains in such a way that the immunoglobulin 
can bind the antigen optimally. 

Heavy and light chains are composed of constant and 
variable domains. In the organisms producing immunoglobu- 

25 lins in their natural state the constant domains are very 
important for a number of functions, but for many applica- 
tions of antibodies in industrial processes and products 
their variable domains are sufficient. Consequently many 
methods have been described to produce antibody fragments, 

30 but none of these have been particularly successful for 
large scale production at low costs. 
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European patent application EP-A1 584 431 (C. Casterman 
and R. Hamers) discloses the isolation of animal -derived 
immunoglobulins devoid of light chains (also indicated as 
heavy chain immunoglobulins) , which can especially 
5 originate from animals of the camelid family (Camelldae) . 
These heavy chain immunoglobulins are characterized in that 

- they comprise two heavy polypeptide chains sufficient f or - 
the formation of one or more complete antigen binding 
sites, whereby a complete antigen binding site means a site 

10 which alone will allow the recognition and complete binding 
of an antigen, which can be verified by any known method 
regarding the testing of the binding affinity. The European 
patent specification further discloses methods for 
isolating these heavy chain immunoglobulins from the serum 

15 of Cornel idae and details of the chemical structure of these 
heavy chain immunoglobulins . It also indicates that these * 
heavy chain immunoglobulins and derivatives thereof can be 
made by using recombinant DNA technology in both : 
prokaryotes and eukaryotes . Due to the absence of light ; 

20 chains in most of the immunoglobulins of Camelldae linkers 
are not necessary. The majority of the protein A-binding , 
immunoglobulins of Camelidae consists just of two heavy 
chains and that these heavy chains are quite different from 
common forms of heavy chains, as the C H 1 domain is replaced 

25 by a long or short hinge. 

The above illustrates that antibodies are proteins of high 
potential importance for industrial application. However, 
many of these applications cannot be realised because of 
30 the higlv costs of antibodies if produced by culturing 
cells. 
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Applicants have found that the present inventions can 
advantageously be applied to improve the secretion of 
various types of antibodies and fragments thereof. 
Especially the invention can advantageously improve the 
5 secretion of antibody fragments containing one or more 
variable domains. Examples of suitable antibody fragments 
are ScFv, V H , V L , and V^,. 

L.G.J. Frenken et al (1994) describes the production of 
10 ScFv, V H and V L Antibody fragments in Saccharomyces 

cerevislae. It was found that antibody fragments such as 
ScFv-LYS, V H and V L can be produced in yeast but tend to 
accumulate in the Endoplasmatic Reticulum and hence have a 
low secretion efficiency. It is postulated that the v 
15 secretion might be hampered by- the formation of large 
aggregates, due to illegitimate interactions of the 
hydrophobic regions on the V H and V L fragments. 

Applicants have now found that the secretion of antibodies 
20 or antibody fragments, in particular ScFv fragments or V H 
and V L fragments can markedly be improved if the antibody • 
(fragments) are glycosylated at specific locations. 

As described above the location of the glycosylation groups 
25 is chosen such that 

1. in the linear form of the protein the glycosylation 
site is located between the N-terminus of the protein 
and a hydrophobic stretch of the protein; and 
(b) in the 3D form the glycosylation groups are located 
30 such that they do not inhibit the functionality of the 
antibody fragments. 



WO 00/05389 



PCT7EP99/04919 



21 



With respect to the location of the hydrophobic stretch in 
the linear protein, Example III describes the linear amino 
acid sequence of antibody fragments R2, R5 and R7 and 
5 replacement mutations thereof. In accordance to the 
- invention the. first requirement is that the glyjcqsylation _ 
site is located between the N-terminus and the first 
hydrophobic stretch. 

10 The second requirement is that the location of the 

glycosylation site is chosen such that the functionality of 
the antibody (f ragment ) . is not materially affected. In this 
respect reference is made to figure 2 which shows the. 3D . 
structure of the R2 antibody fragment (production described 

15 in example III) . 

In the 3D structure it can be seen that the 3 CDRs *(1) as 
indicated in black are located at the upper end of the 
antibody fragment structure- Also it can be seen that a 

20 plane (2) substantially passing through amino acids 25,- 
102, 94, 34, 51 and 57 (notation in accordance to Kabatt) 
separates the CDR area from the framework of the antibody 
fragment. Hence in accordance to the invention it is 
preferred that any glycosylation group in the 3D structure 

25 either has a distance to plane (2) which is greater than 
the size of the glycosylation group. For example if the 
size of the glycosylation group is about 10 Angstrom, the 
distance should be at least 10 Angstrom, more preferred 
more than 12 Angstrom or even more than 15 Angstrom. 

30 Alternatively the distance to plane (2) is smaller than the 
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size of the glycosylation group, but then the glycosylation 
group at least partially points away from said plane in 
order to avoid interaction of the glycosylation group with 
the CDRs (1) . 

5 

As described above, in order to introduce glycosylation 
during the production of a protein an acceptor group is 
needed which can either be Asn-X-Ser or Asn-X-Thr or 
possibly Asn-X-Cys (where X is any amino acid except 
10 proline) - - - - - • - 

Therefore in order to ensure that adequate glycosylation' 
takes place in accordance to the invention, ; the skilled 
person will seek possible mutations for the antibody 
15 fragment, ..which results in the availability of an 

additional acceptor group which can act as a glycosylation- 
site, said acceptor group being located before the first 
hydrophobic stretch, and said acceptor group in the 3D 
structure being located below plane (2) of figure 2. 

20 

It is believed to be well within the ability of the skilled 
person based on the above teaichings to select the 
appropriate mutation sites in the antibody fragments. 

25 For example it can be expected that the introduction of a 
glycosylation site at amino acid 11 (3) or the introduction 
of a glycosylation site at amino acid 17 (4) will indeed 
lead to a better secretion due to the fact that the 
glycosylation site is located before the hydrophobic 

30 stretch and also substantially below plane (2) . 
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Glycosylation at amino acids 60 (5), 81 (6) or 82b(7) are 
borderline cases, because although in the linear sequence 
the location of the glycosylation groups would be indeed 
5 between the N-terminus and the hydrophobic stretch, figure 
.2 clearly illustrates that these amino acids are located 
fairly close to plane (2) and hence the glycosylation 
groups may or may not interact with the CDR regions, 
dependant on the orientation of the glycosylation groups. 

10 

Similarly for ScFv fragments, being associated complexes of 
a V H and a V L domain (production see L.G.J.. Frenken et al 
(1994))- The structure is believed to be remarkably similar 
to the stucture of R6 of Figure 2, however now a V L 
15 fragment is associatively bonded to the V H domain around 
amino acid number 60 (numbering in accordance to Kabatt) ; . 

The amino acid sequence of the Vh fragment is for example 
given in Example III. The amino acid sequence of the light 
20 chain is as follows: 

10 20 30 40 50 

....|....|....| |. ...|. ...I. ...I. ...I. ...I-..- I 

Dl . 3 DIELTQSPASLSASVGETVTITCRASGNIHNYIAWYQQKQGKSPQIXVYY 

25 

60 70 80 90 100 

. ... t .... I I I I.... I. ...I I I. --.I 

Dl . 3 TTTLADGVPSRFSGSGSGTQYSLKINSLQPEDFGSYYCQHFWSrPRTFGG 

30 . . . . I . . . 

D13 GTKUEIKR 



The sequence of a possible peptide linker is as follows: 

35 



WO 00/05389 



PCT/EP99/04919 



24 

GGGGSG6GGSG666S 

Based on this structure and the considerations as explained 
5 above, the skilled person would expect that the 

introduction of a glycosylation site in the V H part by 
mutation of the 11 position (LllN f A13S, P14A) or a 
mutation at the 17 position (S17N) would lead to improved 
secretion . 

10 

Based on the above-presented reasoning glycosilation at 
amino acid positions 108(8) (T108N) or 110(9)" (T110N) in : 
the V H part, however is likely not to result in the desired 
effect . 

15 

Glycosylation at amino acids 81 (K81N and N82aS) or 
82b(S82bN) or 5 (Q5N) in the V H part are borderline cases, 
because, although in the linear sequence the location of 
the glycosylation groups would be indeed between the N- 
20 terminus and the hydrophobic stretch, these amino acids are 
most likely located fairly close to the CDR area and hence 
the glycosylation groups may or may not interact with the 
CDR regions, dependant on the orientation of the 
glycosylation groups. 

25 

Additionally the following mutations in the V L part are 
believed to be successful: at the 5 position T5N, at the 10 
position S10N, at the 12 position S12N, at the 18 position 
T18N. 
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Applicants believe that based on the above teaching the 
skilled persons will be able to improve the secretion of 
other relatively hydrophobic proteins. Especially good 
results are expected for the production of antibodies or 
5 antibody fragments, in particular antibodies which are 
devoid of light chains or fragments thereof. 

The invention also involves the provision of novel 
glycosylated proteins comprising at least one glycosylat ion 
10 group and at least one hydrophobic stretch, said 

glycosylation group being located between the N-terminus of 
the protein and the hydrophobic stretch, and wherein in the 
3 dimensional form of the protein, 

1. the distance between the glycosylation site and 
15 the binding site (s) and/or active site(s) is 

greater than the length of the glycosylation 
group; or 

(b) the distance between the glycosylation site and 
the binding site(s) and/or active site(s) is less than 
20 the length of the glycosylation group, provided the 

orientation of the glycosylation group is such that it 
cannot interact with the binding site(s) and/or the 
active site(s) of the protein. 

25 The invention further comprises gene sequences capable of 
encoding this protein. Preferably said gene sequence 
comprising at least one partial sequence encoding a 
hydrophobic stretch in the protein and a second partial 
sequence encoding a glycosylation site located between the 

30 N-terminus of the protein and the hydrophobic stretch. The 
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invention also relates to vectors comprising such a gene 
sequence as well as a genetically modified eukaryotic cell, 
comprising capable of producing the proteins of the 
invention or comprising this vector. 

5 

Furthermore the invention relates to consumer products, for 
example food products or products for cleaning or treatment 
of fabrics, the human body or hard surface, said consumer 
products comprising a protein in accordance to the 
10 invention. 

The invention Will be further illustrated in the examples.- 
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Example I 

Wild type (CY000) , and mutant forms of Fusarium solani plsi 
cutinase were expressed in Saccharomyces cerevislae (strain 
5 CEN. PK 111-32D) using the vectors and methods as described 
in.Sagt_CM et al, (1998) and in van Gemeren I. Et al 
(1995) . The mutations were introduced by Polymerase Chain 
Reaction (PCR) as described in Hedstrom L, et al (1991) . 
Standerd techniques of molecular cloning were used as 
10 described in Sambrook,. J. , Fritsch, E.F. and Maniatis 
(1989). 

A mutant cutinase was produced (CY047) with an A29S 
mutation. This mutation introduces a glycosylation site, 
15 into the cutinase. 

. A mutant was produced (-CY028) with G82A, A85F, V184I, 
.A185L, L189F mutations (as described in WO 94/14963) This 
modification results in a significantly increased 
20 hydrophobicity and increased binding affinity with respect 
to the binding protein (BiP) and hence a significantly 
higher BiP score. The aggregated BiP score of said protein 
is about 80 units higher than the BiP score of the 
corresponding wild-type. 

25 

A mutant (CY181) was produced with G82A, A85F, V184I, 
A185L, L189F mutations as well as a A29S mutation- These 
mutations introduce both a glycosylation site and 
hydrophobicity into the protein. The glycosylation site is 
30 located between the N-terminus of the cutinase and the 
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first hydrophobic stretch. Furthermore, the glycosylation 
site is located in the 3D structure at the lower end as 
illustrated in figure 1. 

5 A mutant (CY182) was produced with G82A, A85F, V184I, 

A185L, L189F mutations as well as a R211N mutation. In this 
mutant the glycosylation site is located between the C- 
terminus of the cutinase and the hydrophobic stretch. 

_10. S_. _ cerev.islae. .transf ormants_ expressing _w.ild-ty.pe -cutinase— 
and the mutants were each grown overnight in YP 2% glucose 
/. (YP = 1% Difco yeast extract, 2% bacto peptone) and diluted 
1:10 in YP 5% galacotose to induce cutinase production. . . 
After 24 hours, samples of 1 ml were taken and centrifuged 
15 for 1 min at 14,000 X g to pellet the yeast cells. The v ■ 
amount of secreted cutinase in the medium was determined 1 
with the para-nitophenyl butyrate (PNPB, Sigma) assay as 
described in Kolattukudy et al (1981). The results are 
shown in figure 5 . 
20 - . . 

The wild-type and the CY047, CY028 and CY181 mutant 
cutinase expressing cells were also grown under the same 
conditions, whereby the 5% galactose solution also 
contained 5 (ig/ml tunicamycin to inhibit glycosylation. The 

25 results are shown in figure 6. 

These results clearly show that glycosylation of the Wild 
type cutinase does not materially affect the amount of 
secreted cutinase. The introduction of hydrophobicity, 
however, greatly reduced the secretion of cutinase (compare 

30 Wild-type and CY028) . The introduction of glycosylation, 
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however significantly restored the secretion efficiency 
(compare CY028 and CY181) . The reason for this restored 
efficiency is indeed the presence of glycosyiation before, 
the hydrophobic stretch. This is, clearly shown in the 
5 experiments where glycosyiation is inhibited by tunicamycin 
leading to a significant reduction of secretion efficiency 
for the CY181. It can be concluded that CY182 cutinase with 
the glycosyiation site at the C-terminus is not as 
efficiently secreted as CY181. This indicates that the site 

10 of glycosyiation is more powerful in enhancing the 

secretion if it is located before the hydrophobic stretches 
than when it is located after the hydrophobic stretches. 
This could be due to the decreased affinity for BiP of 
CY181. , . v ". 

15. . . * 

Example II 

Construction of Plchia pastoris strains carrying cutinase. 
genes and the extracellular production of cutinase and 
20 cutinase mutants by this lower eukaryote . 

The Pichla pastoris strain GS115 <his4, Mut + ' Invitrogen, USA ) 
was used. Cutinase variant genes were excised from the S. 
cerevisiae expression plasmids described above by digestion of 

25 the plasmid DNA with Sad and ffindlll. The cutinase genes were 
ligated into the Sacl/Hindlll vector fragment derived from a 
modified pBR322 (Clonetec, USA) vector, that contained a 
linker with a BamHI, Sad, Hindi 1 1 and an EcoRI site in that 
order inserted at between the BamHI and £coRI sites in pBR322. 

30 The cutinase encoding fragment released from this intermediate 
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vector by digestion with BamHl and EcoRl was ligated into the 
expression vector fragment of. pPIC9 after digestion with BamHI 
and EcoRI . This placed the cutinase encoding sequences, linked 
to the SUC2 secretion signal sequence under the control of the 
5 AOX promoter of pPIC9. Before transformation, the plasmid was 
linearized with Aatll. 

For transformation of Plchia pastoris , Pichia pastoris GS115, 
cells were grown overnight in YP 2% glucose in shake flasks at 

10 30°C. 0.1-0.5 ml of the overnight culture was inoculated in 
100 ml of fresh YP 2% glucose (to an ODeoo of approximately 
0,3) and grown o/n at 30°C until the OD 6 oo reached 1,3-1,5. The 
cells were centrifuged at 4000 x g for 5 min. at 4°C and 
washed twice in 100 and 50 ml ice-cold sterile water and in 4 

15 ml ice-cold 1M sorbitol. After centrifuging (4000 x g, '5 min/, 
4°C) the pellet was resuspended in 1 ml ice-cold 1M sorbitol. 
75 ul of cells were mixed with 12.75 ug of linearized DNA 
respectively and transferred to an ice-cold electroporation • 
cuvette (E. coli pulser cuvette, Bio-Rad) and the cuvette with 

20 the cells was incubated on ice for 5 min. The cuvette was 
transferred to a Bio-Rad gene pulser and the cells were 
transformed by application of a pulse of 1.5 kV, 25 u.F and 400 
Q. After the pulse the cells were immediately transferred into 
800 ul YP 2% glucose at 30°C and were incubated for one hour 

25 at 30°C. The cells were washed in 1M sorbitol and thereafter 
resuspended in 200 ul sorbitol. The cells were plated on YNB 
1% glucose plates ( YNB= Yeast Nitrogen Base without amino 
acids (a.a.), Difco) , and the plates were incubated at 30°C. 
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P. pastoris transf orraants containing expression cassettes for 
either CY000, CY028 or CY181 cutinase mutants (see example I) 
that were able to grow on YNB 1% glucose plates after 
electroporation were inoculated in 10 ml of BMG (lOOmM 
5 potassium phosphate buffer, pH 6.0, IX YNB without a. a., 1% 
glycerol) in a 400 ml shake flask and grown at 30°C at 200 rpm 
until the culture reached an OD 6 oo of 2-6. The cells were 
harvested by centrif ugation at 4000 X g for 5 min. at room 
temperature. The supernatant was decanted and the cell pellet 

10 was resuspended in 50 ml BMMY (lOOmM potassium phosphate 
buffer, pH 6.0, IX YNB w/o a. a., 1% Difco yeast extract, 2% 
bacto peptone 0.5% methanol) to induce expression of the 
cutinase variants. Cells were grown in a 400 ml shake flask at 
30°C at 200 rpm. To maintain induction, 250 |il of 100% 

15 : Methanol was added every 24 hours. Every 24 hours, just before 
adding the methanol, 1 ml of sample was taken. The sample was. 
centrifuged .for 1 min. at 14000 X g and extracellular cutinase 
levels were determined by activity assays with p-nitrophenyl 
butyrate as substrate (PNPB, Sigma) as described above. The 

20 cells and the supernatants were stored at -20°C for further 
analysis . 

In order to compare the differences in secretion between the 
CY000, CY028 and CY181 mutant cutinases, the maximum amount of 

25 extracellular cutinase obtained at the end of log phase growth 
was divided by the OD 600 at that time-point. This results in 
the maximum cutinase secretion in mg/1 per OD 6 oo* 4 different 
CY000 Mut + and 4 different CYO00 Mut s transf ormants were 
studied to obtain these values. The same was done for the 

30 CY028 samples except that in this case only 3 different CY181 
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Mut + and 4 different CY181 Mut s trans formants were analysed. 
There is little difference in secretion between Mut + and Mut 
strains; the maximal amount of secreted cutinase did not 
differ significantly between Mut + and Mut s . 

5 

The average relative secretion of CY028 and CY181 cutinase 
compare to GY000 secretion was calculated. When the 
secretion of CY000 is set at 100%, CY028 is secreted at 41% 
and CY181 is secreted at 79%. This again shows that, 
10 compared to CYOOO, the hydrophobic transformant CY028 is 
thus significantly less well secreted. However, secretion 
can be restored to an acceptable level by introducing of 
the glycosylation group in CY181 . 

15 A P. pastoris CY181 transformant was also grown under -the 
same conditions, but with the addition of 10pg/ml 
tunicamycin in the BMMY medium to prevent glycosylation. 
This lead to secretion that was significantly lower than ■ 
the secretion of CY181 grown in the absence of tumicamycin 

20 and more similar to the level of secretion of CY028. • 

Example III 

Cloning of -the variable domain of lama antibodies (HC-V) 
25 raised against the azo-dye RR6 and improved secretion of 
these HC-V s by S, cerevxsiae 

This example refers to the production of single domain 
heavy chain antibody fragments where improved secretion 
30 yield can be achieved in yeast by genetic mutation at 
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specified positions in the antibody fragment genes so as to 
introduce amino acids that are capable of acting as 
glycosylation sites. 

5 a) Induction of humoral immune responses in llama. 

Male llamas were immunized with a water in oil emulsion : 
(1:9 V/V, antigen in water: specol, subcutaneously and 
intramuscularly. 300 pg of the azo-dye Reactive Red-6 
10 antigen (Procion Rubine MX-B, ICI) coupled to bovine serum 
albumin via its reactive triazine group, were inoculated-: 
per immunization site in 0.75-1.5 ml water* / oil emulsion.. 

Further immunizations were performed at 3 weeks and 5 weeks 
15 from the date of initial immunization. 

b) Isolation of gene fragments encoding llama HC-V domains . 

A 200 ml blood sample was then taken from the inoculated 
20 llamas. and peripheral blood cells were obtained . via . a 

Ficoll-Paque centrif ugation (Pharmacia) . The total RNA was 
isolated from about 10 7 lymphocytes essentially as 
described by Chomczynnski and Sacchi (1987). Following cDNA 
.synthesis (with the Amersham first strand cDNA kit, 
25 Amsersham- Pharmacia, UK), DNA fragments encoding HC-V 

fragments and part of the long or short hinge region were 
amplified by a PCR using the specific primers: 

PstI 

30 V H - 2B 5 1 -AGGTSMARCTGCAGSAGTCWGG-3 1 
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S = C and G, M = A and C, R = A and G, W = A and T, 

Lam-07 Hindi I I 
5 5 1 -AACAGTTAAGCTTCCGCTTGCGGCCGCGGAGCTGGGGTCTTCGCTGTGGTGCG-3 1 

Lam-08 Hindi I I 

5 1 -AACAGTTAAGCTTCCGCTTGCGGCCGCTGGTTGTGGTTTTGGTGTCTTGGGTT-3 1 

10 The PGR reaction was carried out for 32 cycles of 1 mih 94 
°C, 1.5 min at 55 °C and 2 min at 72 °C. 

• Upon digestion of the PCR fragments with PstI and BstEII, 
the DNA fragments with a length between about 300 and 450 

15 base pairs encoding the HC-V domain, but lacking the first ■ 
three and the last three codons, were purified via gel 
electrophoresis (using a Quiaex DNA isolation kit, 
Quiagen) . 

20 c) Construction of S. cerevxsxae episomal expression 
plasmids encoding llama HC-V domains . 

Saccharomyces cerevlsiae episomal expression plasmids 
pUR4547 and pUR4548 were derived from pSYl (Harmsen et al . , 
25 1993) and have been deposited at the Centraal Bureau voor 
Schimmelcultures, Baarn deposition numbers: CBS 100012 and 
CBS 100013, respectively. See restriction maps of figures 3 
and 4 . 
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Both plasmids contain the GAL 7 promoter and PGK 
transcription terminator sequences as well as the invertase 
(SUC2) signal sequence. The signal sequence is followed by 
the first 5 codons of the HC-V domain (including the PstI 
5 site), a stuffer sequence which can be removed by digestion 
with Pstl/BstEII, then the last six codons of the HC-V 
domain. 

In pUR4547, these six codons are followed by two stop. 
10 codons, an Aflll and a Hindi I I site. In pUR4548, these six 
codons. are followed by eleven codons encoding a myc epitope 
-tag, two stop codons, an Aflll and Hindlll site. 

A vector fragment was obtained from pUR454 8 by digestion 
15 with PstI and BstEII restriction endonulceases, and ligated 
with the approximately 300-450 bp PstI - BstEII fragments 
obtained as described in (b.) The effect of this is to 
create a genetic library where fragments encoding 
particular HC-V domains are contained within a single 
20 plasmid. 

After transformation of S. cerevislae (Finlayson S,D., et 
al 1991) transformants were selected from minimal media 
agar plates (comprising 0-7 % Difco yeast nitrogen base, 2 
25 % glucose and 2 % agar) . 

d) Screening for antigen specific HC-V domains. 



For the expression and secretion of llama HC-V fragments 
30 linked to the myc-tag, individual transformants containing 
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plasmids derived from pUR4548 were grown overnight in 
selective minimal medium (0.7 % yeast nitrogen base, 2 % 
glucose) and subsequently diluted ten times in YPGal medium 
(1 % yeast extract, 2 % bacto peptone and 5 % galactose) . 

5 

After 24 and 4 8 hours of growth, the culture supernatants 
of the transf ormants were analysed by enzyme linked 
immunosorbent assay (ELISA) for the presence of HC-V 
fragments that specifically bind to the antigen RR6. For 

10 the EL ISA , RR6 was covalently coupled to covalink plates 
(NUNC, 478042) and after incubation with the culture 
supernatants, bound antibodies were detected using the 

. . anti-myc monoclonal antibody 9E10 (ATCC) and a polyclonal 
; goat-anti-mouse-HRP conjugate (Bio-Rad, 172-1011) . 

15 

In this way, a number of anti-RR6 HC-V fragments were 
^ isolated, among which are those referred to as R2, R5 and 
R7 . These fragments have the following amino acid 
sequences. 

20 
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10 20 30 40 

! I I I I I -abc. - . | I I . . 

R2 QVQLQE S GGGLVQAGG S LRlt S CAAS GRAT S GHGH YGMGWFRQVPGKERE F 
5 R5 QVQLQE S GGGLVQAGGSLRL S CAAS GRT S H G YGGYGMGWFRQVPGKEREL 
R7 QVQLQESGGGLVQTGDSLRLSCAASGRTSHGYGGY(adGWFRQIPGKEREL 

50 60 70 80 90 

- ... | . .a. ..| ..... | ..... | ..... I .... I .... I - .abc. . | ... . | . . . 
10 R2 VAAI RWS GKE TWYKD S VKGRFT I SRDNAKTTVYIiQMNS LKPED TAVY Y CA 
R5 VAAIRWSGTSTYYADSVKGRFTISRDNVKNMVYIiQMNSLKPEDTAVYHCA 
R7 VAAIRWSGRNT Y YAD S VKGRFT I S RDNVKDML YXQMD SLKPEDTAVTYCA 



15 " 1. HO 

. | . . . . Jabcdefgh. ... | .... | . . . 
R2 ARPVRVDD I SLPVGFD YWGQGTQVTVS S 
R5 ARTVRWD I S SPVGFAYWGQGTQVTVS S 
R7 VRTVRWD I S S PVGFAYWGQGTQVTVS S 



20 



25 



The gene fragments encoding these anti-RR6 fragments were 
subcloned: from the pUR4548 into pUR4547 resulting in the 
plasmids shown in table 1: 

Table 1: plasmids 



Fragment 


pUR4548 derivative 


pUR4547 derivative 


R2 


pUR4 633 


pUR4 64 3 


R5 


pUR4 636 


PUR5353 


R7 


pUR4 638 


pUR4644 



Construction of S. cerevisiae multicopy integration vectors 
for the expression of llama HC-V domains combines the 
30 benefits of high copy number and mitotically stable 

expression. The concept of a multicopy integration system 
into the rDNA locus of lower eukaryotes and the proven 
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procedures for obtaining such constructs are described by 
Giuseppin et al . 1991 (WO-A-91/00920) . 

5 e) Construction of glycosylation mutants of HC-V(RR6) with 
improved secretion yield 

Of the anti-RR6 HC-V fragments identified in (d) , R2, R5 
and R7 were subjected to genetic modifications to introduce 

10 amino acids that predispose the protein to glycosylation . 
R2 encoded by the plasmids pUR4643 and pUR4633 is 
efficiently secreted by S. cerevlsiae but the total 
production level is low. R5, is well produced but 
accumulates intracellularly if expressed in S. cerevlsiae 

15 from' the plasmids pUR4636 or pUR5353. R7 is produced at 1 a ; 
relatively high level in S. cerevisiae from the plasmids 
pUR4638 or pUR4644 but like R5, the majority of the protein 
remains intracellular . 

20 In total,' six different replacements are described for the 
three different antibody fragments. All mutations were 
performed by making use of the overlap-extension PCR 
technique (Horton et al., 1989). The Amino acid 
replacements, the corresponding nucleotide sequences of the 

25 oligo's used to introduce these mutations and the oligo' 
names are shown in the table 3. 

The plasmids pUR4643 (R2 fragment), pUR4636 (R5 fragment 
for mutations 1, 2, 5, 6 and 7) pUR5353 (R5 fragment for 
30 mutation 3) and pUR4644 (R7 fragment) were used as 
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templates for the DNA modifications in the HC-V fragment 
coding sequences. 

The technique of splicing by overlap PCR extension required 
5 the creation of separate left hand and right hand PCR 
fragments and the subsequent combination of the two to 
generate a coding region for the desired mutations. 

Left-hand (=Upstream) fragments were created by making use 
10 of the 5' oligo' JA-L-gl (table 2) and depending on the 
desired modification, one of the 3' oligos designated with 
an *L' shown in table 2. The first number in the oligo' 
corresponds to the location of the amino acid replacement 
and the last number, or numbers, to the antibody fragment 
15 to : , be. modified. The right-hand (—downstream) fragments were 
created by making use of the 3' oligo JA-R-gl (table 2) and 
the 5' oligo' containing the inverse complement of part of 
the 3' A L' oligo. 

20 Table 2. The 5' and 3' outermost PCR primers 



Oligo' 
Number 


Oligo' Sequence 


JA-R-gl 


TAGCTCACTCATTAGGCACC 


JA-L-gl 


GCCTTTAGCTATGTTCAG 



PCR was performed using the following conditions: 1 min at 
25 95 °C, 1.5 min at 50 °C and 1 min at 72 °C for 25 cycles. 
Subsequently, the resulting PCR fragments were combined by 
performing a PCR reaction with the appropriate left T and 
right-hand fragments as templates and JA-L-gl. and JA-R-gl 
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oligo's as primers. The conditions used were as described 
above . 

The products of these reactions were digested either with 
5 PstI and BstEII (R2 and R5 mutants 1, 2, 5, 6 and 7 and R7 
mutants 1, 2 and 3) or PstI and Hindlll (R2 mutant 3 and R5 
mutant 3 in which the BstEII site is removed by the 
mutation) and the HC-V coding fragment isolated and cloned 
into the pUR4547 vector fragment obtained by digestion with 
10 PstI /BstEII or Pstl/ffindlll as appropriate. 

The expression plasmids so formed are listed in table 3. 

Saccheromyces cerevisiae was transformed with these 
15 plasmids and expression of the HC-Vs induced as described 
above. Samples of the culture supernatant s from these 
transf ormants were analysed by polyacrylamide gel 
electrophoresis (PAGE) using a Multipho II (Amersham- 
Pharmacia) or Mini-Protean (Bio-Rad) system according to 
20 the manufacturers instructions. As glycosylated proteins do 
not bind Coomassie Brilliant Blue protein stain as 
efficiently as non-glycosylated proteins, samples were also 
treated with Endo H (Boehringher ) according to the 
manufacturers instructions to remove the glycosylation . 

25 

The presence of the glycosylation sites had a marked effect 
on the antibody fragments produced. The results for R2 
showed that the secreted forms of the modified HC-V 
fragments were indeed glycosylated as could be seen by an 
30 increase in the apparent molecular weight of the proteins 
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which restored to normal after Endo H treatment. Mutants 
number 1 and 3 appear only to be core glycosylated as they 
show a more discrete band on the PAGE gels compared to the 
other variants which show diffuse higher molecular weight 
5 bands. Glycosylation had no effect on the levels of 
secretion of the R2 derivatives. However, for R5 and R7 
(the poorly secreted proteins) significant improvements in 
the levels of secretion were observed for the mutant R5-1, 
R5-2, and for the R7 variants R7-1 and R7-2 . The mutant 3 
10 derivatives showed no improvement in secretion. This is in 
accordance with the invention as R5-1, R5-2 and R7-1 and 
R7-2 comprise glycosilation sites before the hydrophobic 
stretch. 

Slight improvement was found for mutants R5-6 and R5-7. 
15 . 

These results are summarised in table 4. 
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Table 4 Results of antibody glycosilation experiments 



RR6 binding fragment 


Mutation number 


Secretion 


R2 


1 


+ 




2 


+ 




3 


+ 




5 


+ 




6 






7 


+ 


R5 


1 


+ 




2 


+ 




3 






6 


slight 




7 


slight 


R7 


1 


+ 




2 


+ 




3 
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Claims 

1. A method for producing a protein, preferably a 
heterologous protein, in a lower eukaryotic cell, 
followed by secretion of said protein from said cell, 
said protein comprising at least one hydrophobic 
stretch, wherein the protein is glycosylated such as 
to increase the level of secretion of the protein, 
without substantially affecting the functionality of 
the protein and wherein at least one N-giycosylation 
group is located between the N-terminus of the protein 
and the first hydrophobic stretch and with the proviso 
that 

a) the protein is not a single chain Fv molecule against 
human TfR with an N linked glycosilation site at 
position 19 in FR1 of V H/ and 

b) the protein is not a single chain Fv molecule against 
hapten DNP with an N linked glycosilation site at 
position 19 in FR1 of V H . 

2. A method according to claim 1, wherein the hydrophobic 
stretch of the protein has an individual BiP score of 
10 or more and/or the protein has an aggregated BiP 
score which is at least 50 units above the BiP score 
of the corresponding wild-type protein, 

3. A method according to one or more of the preceding 
claims wherein in the 3 dimensional form of the 
protein, 
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(a) the distance between the glycosylation site and 
the binding site(s) and/or active site(s) is greater 
than the length of the glycosylation group; or 

(b) the distance between the glycosylation site and 
the binding site (s) and/or active site(s) is less than 
the length of the glycosylation group, provided the 
orientation" of the glycosylation group is such that it 
cannot interact with the binding site(s) and/or the 
active site(s) of the protein. 

4. A method according to one or more of the preceding 
claims wherein the protein has a substantially 
ellipsoidal three dimensional shape, wherein the 
binding and/or active site(s) are located at the upper 
end of the ellipsoid and the glycosylation groups are 
located at a substantial distance from said upper end. 

5. A method according to one or more of the preceding 
claims, wherein the protein is a lysozyme, cutinase or 
antibodies, especially variable domains thereof. 

6. A method according to any of the preceding claims, 
wherein the eukaryotic cell is a yeast or a mould. 

7. A method according to claim 6, wherein the yeast 
belongs to the genera Saccharomyces, Kluyveromyces, 
Hansenula or Pichia. 



8. 



A method according to claim 6, wherein the mould 
belongs to the genera Aspergillus or Trichoderma. 
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Glycosylated protein comprising at least one 
glycosylation group and at least one hydrophobic 
stretch, said glycosylation group being located 
between the N-terminus of the protein and the 
hydrophobic stretch, and wherein in the 3 dimensional 
form of the protein, 

(a) the distance between the glycosylation site and 
the binding site(s) and/or active site(s) is greater 
than the length of the glycosylation group; or 

(fc>) the distance between the glycosylation site and 
the binding site(s) and/or active site(s) is less than 
the length of the glycosylation group, provided the 
orientation of the glycosylation group is such that it 
cannot interact with the binding site(s) and/or the 
active site (s) of the protein. 



10. A gene sequence capable of encoding the glycosylated 
protein of claim 9 or the protein obtainable in the 
process according to one or more of claims 1-8. 

11. A gene sequence according to claim 10, comprising at 
least one partial sequence encoding a hydrophobic 
stretch in the protein and a second partial sequence 
encoding a glycosylation site located between the N- 
terminus of the protein and the hydrophobic stretch. 



12. A vector comprising a gene sequence in accordance to 
claim 10 or 11. 
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13. Genetically modified eukaryotic cell, capable of 
producing a protein in accordance to claim 9 or 
comprising a vector according to claim 12 



14 . Consumer product comprising a protein in accordance to 
claim 9 or a gene according to claim 10 or a 
genetically modified eukaryotic cell according to 
claim 13.' 
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A/arI.51 




3437,BspHI I BspHI,3030 



BspHI,3135 



AfUl 

PstI BstEIl Hindlll 

<- MPG 124 RL (47) -> 

CTGCAGGAGTCATAATGAGGGACCCAGGTCACCGTCTCCTCATAATGACTTAAGCTT 

+ + -4 -4- + 

GACGTCCTCAGTATTACTCCCTGGGTCCAGTGGCAGAGGAGTATTACTGAATTCGAA 

<- MPG 125 RL (55) -> 

LQES**GT. QVTVSS**LKl 
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6207,HpaI 



5658,FspI N 



Nar 1,51 
Fspl,72 

' Pvu 1,92 
Pvufl,122 
' Eco RI.21 2 
Sma 1,220 
Sam HI.31 3 
A//7eI,431 
" Sac 1,520 
Fsp 1,605 
fPvun.615 
Psf 1,618 
Bsfti,644 
11,699 
H/ndm,702 
C/a 1,708 



5145,FspI. 



4860,C/aI— 



4439,S«yI-; 
4415,SfyI / 
4378,EcoRV 

4196,SfyI 




Nhel,974 
PvuII,1157 



-SspHI,2055 



3945 



3781 ,C/aI 



3470,BspHI 



espHI.3063 
BspHI,3168 



Fsp 1,2448 

\ 'Pvul,2595 
Sea 1,2706 



Af II 1 

Hindi II 



Pstl BstEll 

<- HPC 126 RL (31) -><- MPG 128 RL (49) -> 
CTGCACGAGTCATAATGAGGGACCCAGGTCACC6TCTCCTCAGAACAAAAACTCATCTCAGAAGAGGATCTCAATTAATGACTTAAGCTT 
_ _ _ _ . _^ . + + + — + - + 

GACGTCCTCAGTATTACTCCCTGGGTCCAGTGGCAGAGGAGTCTTGTTTTTGAGTAGAGTCITCTCCTAGACTTAATTACTGAATTCGAA 

<- HPG 127 RL (45) -><- HPG 129 RL (43) -> 

LQES**GT0VTVSSEQKL1SEEDLN**LICL 

<- myc tail' -> 
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Fig. 5. 
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Cutinase mutant 
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Fig.6. 
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